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RPcnilatorv Sequences f-r l '^n gp^f^ifie or Pollen 
Abundant Gene Expression in Plants 



Description 

The present invention relates to isolated nucleic 
acid molecules functioning as regulatory elements, 
in particular promoters that drive pollen specific 
or pollen abundant expression in plants, in par- 
ticular in monocotyledonous plants, to vectors con- 
taining the regulatory elements, to host cells con- 
taining the vectors, to plants, harvest and propa- 
gation material containing the host cells, to meth- 
ods for obtaining them and to methods for isolating 
regulatory elements that contribute to tissue- 
preferred gene expression in plants . 

Gene expression is considered to comprise a number 
of steps from the DNA to the final protein product. 
Initiation of transcription of a gene is generally 
believed to be the predominant controlling factor 
in determining expression of a gene. The tran- 
scriptional controls are generally located in rela- 
tively short sequence elements embedded in the 5'- 
flanking and/or 3 '-flanking region of the tran- 
scribed gene with which DNA-binding proteins may 
interact. These DNA sequence elements serve to 
promote the formation of transcriptional complexes 
and eventually initiate gene expression processes. 
It is furthermore known that the regulation of gene 
expression often depends upon the development stage 
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and the tissue specificity of the cell concerned. 
Thus, certain tissues of organisms such as plants 
may exhibit a metabolism and a protein composition 
different from other tissues of the plant or dif- 
ferent from the same tissue in a different develop- 
mental stage. 

Controlling the expression of genes in transgenic 
plants is considered to have high commercial value. 
The transfer of heterologous genes or genes of in- 
terest into a plant under control of tissue- 
specific regulatory elements provides a powerful 
means of conferring selective advantages to plants 
and to increase their commercial value. The abil- 
ity to control gene expression is useful for con- 
ferring resistance and immunity to certain diseases 
or to modify the metabolism of a tissue. Plant ge- 
netic engineering techniques also prove useful in 
generating improved plants for plant breeding pur- 
poses, such as male sterile plants, in particular 
plants exhibiting cytoplasmic (CMS) or nuclear 
(NMS) male sterility. 

in higher plants, such as maize, the male products 
of meiosis, the microspores, undergo two mitotic 
cell divisions. The first pollen mitosis generates 
the large vegetative cell and a smaller generative 
cell. The generative cell undergoes a second mi- 
totic division forming the two sperm cells. 
Tricellular and bicellular pollen can be distin- 
guished, depending on whether the second pollen mi- 
tosis is performed in the pollen grain or in the 
pollen tube after germination. The vegetative cell 
forms the pollen tube upon germination and trans- 
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ports the sperm cells through the transmitting 
tract of the style to the embryo sac. Pollen tubes 
enter the embryo sacs through the micropyle and 
penetrate one of the two synergids whereupon the 
pollen tube bursts, releasing the sperm cells into 
the degenerating synergid. During the double fer- 
tilisation process one sperm cell fertilises the 
egg cell and the other sperm cell fertilises the 
central cell. Maize pollen tubes have to grow up 
to 30 cm along the style (silk) to reach the embryo 
sac which is deeply embedded within the ovary 
(kernel). Although maize pollen tubes reach a 
growth rate of approximately 1 cm/h. it takes sev- 
eral hours to reach the embryo sac. During this 
time pollen tubes have to be guided through the 
transmitting tissue of the style and new cell wall 
has to be synthesised to sustain the rapidly grow- 
ing tube tip. TWO groups of pollen-expressed genes 
are distinguished, the so-called early pollen 
genes, which are most abundant prior to the first 
pollen mitosis and the late pollen genes, which are 
thought to have a function during pollen maturation 
and tube growth. 

several pollen-specific cDNAs have been isolated, 
representing unknown proteins, cytoskeletal compo- 
nents, poly(A) binding proteins, translation ini- 
tiation factors, ubiquitin and others (reviewed by 
Taylor and Hepler, 1997). Several pollen specific 
nucleic acid sequences are described. It is known 
for instance that 5' UTRs (untranslated regions) of 
late pollen expressed genes from tomato are suffi- 
cient to alter translation efficiency in pollen 
(Bate et al . , 1996; Curie and McCormick, 1997). 
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The pollen-specific translation initiation factor 
eIF-4A8 was shown to be phosphorylated during pol- 
len tube growth and might be involved in regulating 
translation of late pollen genes (op den Camp and 
Kuhlemeier, 199 8) . 

Furthermore, a pollen ^4ADS box gene DEFH125 of An- 
tirrhinum is disclosed by Zachgo et al . , 1997. Im- 
munolocalization studies showed increasing DEFH125 
expression in maturing stamen and localisation to 
the cytoplasm of the vegetative cell in mature pol- 
len. Finally, kinases have been described which are 
expressed at different stages of pollen development 
(Lee et al., 1996, Estruch et al . , 1994; Mu et al., 
1994; Li and Gray, 1997; Wilson et al., 1997) and 
pollen tube growth (Muschietti et al . , 1998), but 
very little is known about transcriptional regula- 
tion of these complex processes. 

Thus, it is also considered particularly important 
to develop and provide transcriptional regulatory 
units that direct gene expression selectively to 
pollen tissue. Pollen preferred gene expression 
will provide several advantages to a plant, such as 
alteration of the function of the pollen tissue; 
modification of its growth rate; resistance to ad- 
verse weather conditions; as well as providing 
plants exhibiting male sterility, for instance by 
expressing toxic genes in pollen tissue. Pollen 
abundant or pollen specific gene expression would 
provide a mechanism according to which morphology 
and metabolism of pollen may be altered to produce 
useful plants. 
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Thus, there is a need to provide regulatory ele- 
ments capable of directing transcription specifi- 
cally in pollen tissue. 

Thus, the technical problem underlying the present 
invention is to provide novel regulatory elements 
for use in cloning and expressing pollen specific 
or pollen abundant genes, in particular for use in 
monocotyledonous plants which provide a high ex- 
pression efficiency together with a high tissue 
specificity . 

The present invention solves the technical problem 
underlying the present invention by providing puri- 
fied nucleic acid molecules for use in cloning and 
expressing a pollen specific or pollen abundant 
gene in a plant which are selected from the group 
consisting of 

(a) the nucleic acid sequence set out in any one of 
SEQ ID No. 3 to 12, or a complementary strand 
thereof, 

(b) a nucleic acid sequence which hybridises to the 
nucleic acid sequence set out in any one of SEQ ID 
No. 3 to 12, or a complementary strand thereof and 

(c) alleles or derivatives of the nucleic acid se- 
quence defined in (a) or (b) or a complementary 
strand thereof . 

The sequences set out in SEQ ID No. 3 to 9 comprise 
isolated 5' regulatory elements and are considered 
to be promoters or parts thereof, i.e. promoter 
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fragments. These sequences are capable of modulat- 
ing, initiating and/or contributing to the pollen 
specific or pollen abundant transcription of nu- 
cleic acid sequences operably linked to them. In a 
preferred embodiment of the present invention, 
these sequences may additionally contain at their 
3' terminus the nucleotide sequence from position 
+1 to +310 as depicted in SEQ ID No. 10. This se- 
quence is a 5' untranslated region, which may serve 
as linker between a promoter of the present inven- 
tion and a coding sequence linked thereto. 

The sequences set out in SEQ ID No. 11 comprise 
isolated 3' regulatory elements and are considered 
to be 3' transcription regulatory elements, or 
parts thereof, for instance termination and/or 
polyadenylation signals. These sequences are capa- 
ble of modulating, in particular contributing to, 
or terminating the transcription of nucleic acid 
sequences operably linked to them. 

The sequence set out in SEQ ID No. 12 is a genomic 
clone comprising all of the above sequences. 

The present invention also relates to nucleic acid 
sequences which hybridise, in particular under 
stringent conditions, to the sequences set out in 
any one of SEQ ID No. 3 to 12. In particular, 
these sequences have a degree of identity of 70% to 
the sequence of SEQ ID Nos . 3 to 12. 

In the context of the present invention, nucleic 
acid sequences which hybridise to any one of the 
specifically disclosed sequences of SEQ. Id. Nos. 3 
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to 12 are sequences which have a degree of 6 0% to 
70% sequence identity to the specifically disclosed 
sequence on nucleotide level. In an even more pre- 
ferred embodiment of the present invention, se- 
quences which are encompassed by the present inven- 
tion are sequences which have a degree of identity 
of more than 70%, and even more preferred, more 
than 80%, 90%, 95% and particularly 99% to the spe- 
cifically disclosed sequences on nucleotide level. 

Thus, the present invention relates to nucleic acid 
sequences, in particular DNA sequences which hy- 
bridise under the hybridisation conditions as de- 
scribed in Sambrook et al., (1989) in particular 
under the following conditions to the sequences 
specifically disclosed: 

Hybridisation buffer: 1 M NaCl; 1% SDS; 10% 
dextran sulphate; 100 ^g/ml ssDNA 
Hybridisation temperature: 65° C 
First wash: 2 x SSC; 0.5% SDS at room 
temperature 

Second wash: 0.2 x SSC; 0.5% SDS at 65°C. 

More preferably, the hybridisation conditions are 
chosen as identified above, except that a hybridi- 
sation temperature and second wash temperature of 
68° C, and even more preferred, a hybridisation 
temperature and second wash temperature of 70° C is 
applied. 

Thus, the present invention also comprises nucleic 
acid sequences which are functionally equivalent to 
the sequences of any one of SEQ ID No. 3 to 12, in 



wo 01/12799 



-8- 



PCT/EPOO/08002 



particular sequences which have at least homology 
to the sequence of SEQ ID No. 3 to 12. The inven- 
tion also relates to alleles and derivatives of the 
sequences mentioned above which are defined as se- 
quences being essentially similar to the above se- 
quences but comprising, for instance, nucleotide 
exchanges, substitutions (also by unusual nucleo- 
tides), rearrangements, mutations, deletions, in- 
sertions, additions or nucleotide modifications and 
are functionally equivalent to the sequences set 
out in SEQ ID No. 3 to 12. 

The nucleic acid molecules of the present invention 
are, in a preferred embodiment, derived from maize 
(Zea mays) , most preferably from maize pollen, in 
particular from a gene called ZmMADS2 . 

The nucleic acid molecules of the present invention 
are useful for cloning tissue specific, in particu- 
lar pollen specific nucleic acid sequences, in par- 
ticular regulatory elements and/or genes, in 
plants, in particular in monocotyledonous plants. 
Thus, the present invention provides the means for 
isolation of transcription regulatory elements that 
direct or contribute to pollen-preferred gene ex- 
pression in plants, in particular in monocotyledon- 
ous plants, such as maize. The present invention 
also provides the means of isolating pollen spe- 
cifically expressed genes and their transcripts. 

The nucleic acid molecules of the present invention 
are also useful for expressing genes of interest in 
plants, in particular in the pollen of plants and 
especially in the pollen of monocotyledonous plants 
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such as maize or of dicotyledonous plants such as 
sugar beets (Beta vulgaris) . Thus, the present in- 
vention provides the means to direct the expression 
of a gene of interest in a tissue-specific or tis- 
sue-abundant manner in pollen. In a particularly 
preferred embodiment of the present invention, the 
promoter of the present invention is expressed in a 
spatially and temporarily specific manner, prefera- 
bly in mature pollen after or during dehiscence, 
but before dehydration of the pollen takes place. 
In a further preferred embodiment the promoter of 
the present invention is able, due to specific se- 
quence elements present in its sequence, not only 
to direct expression in preferably mature, ungermi- 
nated pollen tissue, but additionally also in root 
tips. Of course, by deleting or inactivating the 
root specific sequence element in the promoter of 
the present invention an exclusive pollen specific 
expression without any expression in other tissues 
of a plant may be obtained. Accordingly, the pro- 
teins encoded by the gene of interest can be accu- 
mulated in pollen. For instance, the promoter of 
the present invention is particularly useful in 
driving the pollen specific transcription of het- 
erologous structural genes that confer male steril- 
ity to transgenic plants by expressing exclusively 
in pollen mRNA or proteins distorting pollen devel- 
opment and/or function. Thus, the present inven- 
tion relates to improved means and methods for 
plant breeding, in particular the production of hy- 
brid seed, for instance by creating plants exhibit- 
ing nuclear or cytoplasmic male sterility, in par- 
ticular in grain, cereals and corn. In a particu- 
larly preferred embodiment, the present invention 
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relates to a DNA construct with a promoter and/or 
3' regulatory element of the present invention op- 
erably linked to a coding sequence for a toxic pro- 
tein, for instance a barnase. 

In the context of the present invention, a number 
of terms shall be utilised as follows. 

The term "promoter" refers to a sequence of DNA, 
usually upstream (5') to the coding sequence of a 
structural gene, which controls the expression of 
the coding region by providing the recognition for 
RNA polymerase and/or other factors required for 
transcription to start at the correct site. Pro- 
moter sequences are necessary, but not always suf- 
ficient to drive the expression of the gene. 

A "3' regulatory element (or "3' end") refers to 
that portion of a gene comprising a DNA segment, 
excluding the 5' sequence which drives the initia- 
tion of transcription and the structural portion of 
the gene, that determines the correct termination 
site and contains a polyadenylation signal and any 
other regulatory signals capable of effecting mes- 
senger RNA (mRNA) processing or gene expression. 
The polyadenylation signal is usually characterised 
by effecting the addition of polyadenylic acid 
tracts to the 3' end of the mRNA precursor. Poly- 
adenylation signals are commonly recognised by the 
presence of homology to the canonical form 5'- 
AATAAA-3', although variations are not uncommon. 

"Nucleic acid" refers to a large molecule which can 
be single or double stranded, composed of monomers 
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(nucleotides) containing a sugar, phosphate and ei- 
ther a purine or pyrimidine . The nucleic acid may 
be cDNA, genomic DNA, or RNA, for instance mRNA. 

The term "nucleic acid sequence" refers to a natu- 
ral or synthetic polymer of DNA or RNA which may be 
single or double stranded, alternatively containing 
synthetic, non-natural or altered nucleotide bases 
capable of incorporation into DNA or RNA polymers. 

The term "gene" refers to a DNA sequence that codes 
for a specific protein and regulatory elements con- 
trolling the expression of this DNA sequence. 

The term "regulatory element" refers to a sequence 
located upstream (5'), within and/or downstream 
{3') to a coding sequence whose transcription and 
expression is controlled by the regulatory element, 
potentially in conjunction with the protein biosyn- 
thetic apparatus of the cell. "Regulation" or 
"regulate" refer to the modulation of the gene ex- 
pression induced by DNA sequence elements located 
primarily, but not exclusively upstream (5') from 
the transcription start of the gene of interest. 
Regulation may result in an all or none response to 
a stimulation, or it may result in variations in 
the level of gene expression. 

The term "coding sequence" refers to that portion 
of a gene encoding a protein, polypeptide, or a 
portion thereof, and excluding the regulatory se- 
quences which drive the initiation or termination 
of transcription. 
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The coding sequence or the regulatory element may 
be one normally found in the cell, in which case it 
is called "autologous", or it may be one not nor- 
mally found in a cellular location, in which case 
it is termed "heterologous". 

A heterologous gene may also be composed of autolo- 
gous elements arranged in an order and/or orienta- 
tion not normally found in the cell in which it is 
transferred. A heterologous gene may be derived in 
whole or in part from any source known to the art, 
including a bacterial or viral genome or episome, 
eukaryotic nuclear or plasmid DNA, cDNA or chemi- 
cally synthesised DNA. The structural gene may 
constitute an uninterrupted coding region or it may 
include one or more introns bounded by appropriate 
splice junctions. The structural gene may be a 
composite of segments derived from different 
sources, naturally occurring or synthetic. 

The term "vector" refers to a recombinant DNA con- 
struct which may be a plasmid, virus, or autono- 
mously replicating sequence, phage or nucleotide 
sequence, linear or circular, of a single or double 
stranded DNA or RNA, derived from any source, m 
which a number of nucleotide sequences have been 
joined or recombined into a unique construction 
which is capable of introducing a promoter fragment 
and a DNA sequence for a selected gene product in 
sense or antisense orientation along with appropri- 
ate 3- untranslated sequence into a cell, in par- 
ticular a plant cell. 
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AS used herein, "plant" refers to photosynthetic 
organisms, such as whole plants including algae, 
mosses, ferns and plant-derived tissues. "Plant 
derived tissues" refers to differentiated and un- 
differentiated tissues of a plant, including pol- 
len, pollen tubes, pollen grains, roots, shoots, 
shoot meristems, coleoptilar nodes, tassels, 
leaves, cotyledondous petals, ovules, tubers, 
seeds, kernels and various forms of cells in cul- 
ture such as intact cells, protoplasts, embryos and 
callus tissue. Plant-derived tissues may be in 
planta, or in organ, tissue or cell culture. A 
"monocotyledonous plant" refers to a plant whose 
seeds have only one cotyledon, or organ of the em- 
bryo that stores and absorbs food, 
"dictyledonous plant" refers to a plant whose seeds 
have two cotyledons. 



A 



"Transformation" and "transferring" refers to meth- 
ods to transfer DNA into cells including, but not 
limited to, biolistic approaches such as particle 
bombardment, microinjection, permeabilising the 
cell membrane with various physical (e.g., electro- 
poration) or chemical {e.g., polyethylene glycol, 
PEG) treatments; the fusion of protoplasts or Agro- 
bacterium tumefaciens or rhizogenes mediated trans- 
formation. For the injection and electroporation 
of DNA in plant cells there are no specific re- 
quirements for the plasmids used. Plasmids such as 
pUC derivatives can be used. If whole plants are 
to be regenerated from such transformed cells, 
there should be a selectable marker. Depending 
upon the method for the introduction of desired 
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genes into the plant cell, further DNA sequences 
may be necessary; if, for example, the Ti or Ri 
plasmid is used for the transformation of the plant 
cell, at least the right border, often, however, 
the right and left border of the Ti and Ri plasraid 
T-DNA have to be linked as flanking region to the 
genes to be introduced. 

If Agrobacteria are used for the transformation, 
the DNA to be introduced has to be cloned into spe- 
cific plasmids, either into an intermediary vector 
or into a binary vector. The intermediary vectors 
can be integrated into the Ti or Ri plasmid of the 
Agrobacteria due to sequences that are homologous 
to sequences in the T-DNA by homologous recombina- 
tion. The Ti or Ri plasmid furthermore contains 
the vir region necessary for the transfer of the T- 
DNA into the plant cell. Intermediary vectors can- 
not replicate in Agrobacteria. By means of a helper 
plasmid the intermediary vector can be transferred 
by means of a conjugation to Agrobacterium tumefa- 
ciens. Binary vectors can replicate both in E.coli 
and in Agrobacteria and they contain a selection 
marker gene and a linker or polylinker framed by 
the right and left T-DNA border region. They can 
be transformed directly into the Agrobacteria 
(Holsters et al . , 1978). The Agrobacterium serving 
as a host cell should contain a plasmid carrying a 
vir region. The Agrobacterium transformed is used 
for the transformation of plant cells. The use of 
T-DNA for the transformation of plant cells has 
been extensively examined and described in EP-A 120 
516; Hoekema, (1985); An et al . , (1985). 
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For Che transfer of the DNA into the plant cell 
plant explants can be co-cultivated with Agrobacte- 
rium tumefaciens or Agrobacterium rhizogenes. From 
the infected plant material (e.g., pieces of leaf, 
stem segments, roots, but also protoplasts or plant 
cells cultivated by suspension) whole plants can be 
regenerated in a suitable medium, which may contain 
antibiotics or biozides for the selection of trans- 
formed cells. 

Alternative systems for the transformation of mono- 
cotyledonous plants are the transformation by means 
of electrically or chemically induced introduction 
of DNA into protoplasts, the electroporation of 
partially permeabilised cells, the macroinjection 
of DNA into flowers, the microinjection of DNA into 
micro-spores and pro-embryos, the introduction of 
DNA into germinating pollen and the introduction of 
DNA into embryos by swelling (Potrykus, Physiol. 
Plant (1990), 269-273). 

While the transformation of dicotyledonous plants 
via Ti plasmid vector systems with the help of 
Agrobacterium tumefaciens is well-established, more 
recent research work indicates that also monocoty- 
ledonous plants are accessible for transformation 
by means of vectors based on Agrobacterium (Chan et 
al., (1993); Hiei et al . , (1994); Bytebier et al . , 
(1987); Raineri et al . , (1990), Gould et al . , 
(1991); Mooney et al . , (1991); Lit et al . , (1992)). 

In fact, several of the above-mentioned transforma- 
tion systems could be established for various cere- 
als: the electroporation of tissues, the transfer- 
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mation of protoplasts and the DNA transfer by par- 
ticle bombardment in regenerative tissue and cells 

(Jahne et al . , (1995). The transformation of wheat 
has been frequently described in the literature 

(Maheshwari et al . , (1995). The transformation of 
maize has been described in Brettschneider et al., 

(1997) and Ishida et al . , (1996). 

The term "host cell" refers to a cell which has 
been genetically modified by transfer of a het- 
erologous or autologous nucleic acid sequence or 
its descendants still containing this sequence. 
The host cell may be transiently or stably trans- 
formed and is preferably able to express the trans- 
formed nucleic acid molecule. These cells are also 
termed "transgenic cells". In the case of an 
autologous nucleic acid sequence being transferred, 
the sequence will be present in the host cell in a 
higher copy number than naturally occurring. 

The term "operably linked" refers to the chemical 
fusion of two of more fragments of DNA in a proper 
orientation such that the fusion preserves or cre- 
ates a proper reading frame, or makes possible the 
proper regulation of expression of the DNA se- 
quences when transformed into plant tissue. 

The term "expression" as used herein is intended to 
describe the transcription and/or coding of the se- 
quence for the gene product. In the expression, a 
DNA chain coding for the sequence of gene product 
is first transcribed to a complementary RNA, which 
is often an mRNA, and then the thus transcribed 
mRNA is translated into the above mentioned gene 
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product if the gene product is a protein. However, 
expression also includes the transcription of DNA 
inserted in antisense orientation to its regulatory 
elements. Expression, which is constitutive and 
possibly further enhanced by an externally con- 
trolled promoter fragment thereby producing multi- 
ple copies of mRNA and large quantities of the se- 
lected gene product, may also include overproduc- 
tion of a gene product. 

A "tissue specific promoter" refers to a sequence 
of DNA that provides recognition signals for RNA 
polymerase and/or other factors required for tran- 
scription to begin, and/or for controlling expres- 
sion of the coding sequence precisely within cer- 
tain tissues or within certain cells of that tis- 
sue. Expression in a tissue specific manner may be 
only in individual tissues, or cells within tis- 
sues, or in combinations of tissues. The present 
invention relates in particular to pollen and/or 
root specific expression, i.e. examples may include 
tissue specific expression in pollen only and no 
other tissues within the plant, or may be in pollen 
and roots, and no other tissues of the plant. An 
expression in pollen according to which the expres- 
sion takes place mainly, but not exclusively, in 
the pollen is also termed "tissue abundant". 

The term "pollen specific nucleic acid sequence" 
refers to nucleic acid sequences, i.e. genes, cod- 
ing sequences and/or regulatory elements which are 
exclusively or mainly active in pollen of plants, 
in particular those which direct or contribute to a 
pollen abundant or pollen selective expression of a 
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protein. The term "pollen abundant nucleic acid 
sequence" refers to nucleic acid sequences, i.e. 
genes, coding sequences and/or regulatory elements 
which are mainly active in pollen of plants, in 
particular those which direct or contribute to a 
pollen abundant expression of a protein. 

The present invention also relates to a vector com- 
prising the nucleic acid sequences according to the 
above, in particular to a bacterial vector, such as 
a plasmid, or a virus. 

In a particularly preferred embodiment of the pres- 
ent invention the nucleic acid sequences, i.e. the 
5* and/or 3' regulatory elements of the present in- 
vention contained in the vector, are operably 
linked to a gene of interest which in this context 
may also be only its coding sequence, which may be 
a heterologous or autologous gene or coding se- 
quence. Such a gene of interest may be a gene, in 
particular its coding sequence, conferring, for in- 
stance, disease resistance; drought resistance; in- 
sect resistance; herbicide resistance; immunity; an 
improved intake of nutrients, minerals or water 
from the soil; or a modified metabolism in the 
plant, particularly its pollen and/or roots. Such 
a modified metabolism may relate to a preferred ac- 
cumulation of useful or toxic substances in pollen 
and/or roots, for instance sugars or, vice versa, 
in the depletion of substances undesirable in pol- 
len and/or roots, for instance certain amino acids. 
Thus, in the context of the present invention, a 
gene of interest may confer resistance to infection 
by a virus, such as a gene encoding the capsid pro- 
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tein of the BWYV or the BNYW virus, a gene confer- 
ring resistance to herbicides such as Basta® or 
phosphinotrycine, or to an insecticide, a gene con- 
ferring resistance to the corn rootworm, a gene en- 
coding the toxic crystal protein of Bacillus 
thuringiensis or a gene whose expression confers 
male sterility. A gene of interest includes also a 
coding sequence cloned in antisense orientation to 
the regulatory sequences directing its expression. 
Such an antisense-construct may be used specifi- 
cally to repress the activity of undesirable genes 
in plant cells, in particular in pollen and/or 
roots, for instance to produce male sterile plants 
exhibiting a modified development, for instance an 
abortive pollen development, pollen morphology me- 
tabolism and/or pollen function. The gene of in- 
terest may also comprise signal sequences, in par- 
ticular ER targeting sequences, directing the en- 
coded protein in the ER and eventually for instance 
in the cell wall, vascular tissue and/or the vacu- 
ole. Of course, the gene of interest may in a par- 
ticularly preferred embodiment be the coding region 
of the ZmMADS2 gene as depicted in SEQ ID No. 1 
coding a protein having the amino acid sequence de- 
picted in SEQ ID No. 2, or a part thereof may be 
operably linked in sense or antisense orientation 
to the regulatory element of the present invention, 
allowing for instance for the elimination of wild- 
type ZmMADS2 expression by antisense or co- 
suppression technology or overexpression of ZmMADS2 
protein. 

In a particularly preferred embodiment the vector 
defined above is comprised of further regulatory 
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elements directing or enhancing expression of the 
gene of interest such as 5 ' , 3 » or 5 ' and 3' regu- 
latory elements known in the art. Regulatory ele- 
ments considered in the present invention also en- 
compass introns or parts of introns inserted within 
or outside the gene of interest. 

In a particularly preferred embodiment of the pres- 
ent invention the 3 ' regulatory element is a tran- 
scription termination region, preferably a poly A 
addition or polyadenylation sequence, most prefera- 
bly the poly A addition sequence of the NOS gene of 
Agrobacterium tumefaciens. 

Thus, the nucleic acid sequences of the present in- 
vention are useful since they enable the pollen 
and/or root specific expression of genes of inter- 
est in plants, in particular monocotyledondous 
plants. Accordingly, plants are enabled to produce 
useful, in particular toxic, products in their pol- 
len or the plants may be engineered by modifying 
the pollen structure, function and/or development. 

The present invention also relates to a method of 
genetically modifying a cell by transforming it 
with a nucleic acid molecule of the present inven- 
tion or vector according to the above, whereby the 
gene of interest operably linked to the nucleic 
acid sequence or sequences of the present invention 
is expressible in the cell. In particular, the 
cell being transformed by the method of the present 
invention is a plant, bacterial or yeast cell. In 
a particularly preferred embodiment of the present 
invention, the above method further comprises the 
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regeneration of the transformed cell to a differen- 
tiated and, in a preferred embodiment, fertile or 
non-fertile plant. 

The present invention also relates to host cells 
transformed with the nucleic acid molecule or the 
vector of the present invention, in particular 
plant, yeast or bacterial cells, in particular 
monocotyledonous or dicotyledonous plant cells. 
The present invention also relates to cell cul- 
tures, tissue, calluses, etc. comprising a cell ac- 
cording to the above, i.e. a transgenic cell and 
its descendants harbouring the nucleic acid mole- 
cule or vector of the present invention. 

Thus, the present invention relates to transgenic 
plant cells which were transformed with one or sev- 
eral nucleic acid molecules of the present inven- 
tion as well as to transgenic plants cells origi- 
nating from such transformed cells. Such cells 
contain one or several nucleic acid molecules of 
the present invention guaranteeing the transcrip- 
tion in plant cells, with it/them preferably being 
linked to a coding sequence in sense or antisense 
orientation. Such plant cells can be distinguished 
from naturally occurring plant cells by the obser- 
vation that they contain at least one nucleic acid 
molecule according to the present invention which 
does not naturally occur in these cells, or by the 
fact that such a molecule is integrated into the 
genome of the cell at a location where it does not 
naturally occur, that is, in another genomic re- 
gion, or by the observation that the copy number of 
the nucleic acid molecules is different from the 
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copy number in naturally occurring plants, in par- 
ticular a higher copy number. Preferably, the nu- 
cleic acid sequences of the present invention are 
operably linked to coding sequences of genes of in- 
terest or antisense DNA normally not found in the 
cell to be transformed, at least not in that order 
or orientation. 

Thus, the present invention also relates to trans- 
genic cells, also called host cells, transformed 
with the nucleic acid molecule or vector of the 
present invention, in particular plant, yeast or 
bacterial cells, in particular monocotyledonous or 
dicotyledonous plant cells. The present invention 
also relates to cell cultures, tissue, calluses, 
propagation and harvest material, seeds, seedlings, 
embryos, etc. comprising a cell according to the 
above, that is, a transgenic cell being stably or 
transiently transformed and being capable of ex- 
pressing a gene of interest or sequence under con- 
trol of the nucleic acid molecules of the present 
invention. The transgenic plants of the present 
invention can be regenerated to whole plants ac- 
cording to methods known to the person skilled in 
the art. The regenerated plant may be chimeric 
with respect to the incorporated foreign DNA. If 
the cells containing the foreign DNA develop into 
either micro- or macro-spores the integrated for- 
eign DNA will be transmitted to a sexual progeny. 
If the cells containing the foreign DNA are somatic 
cells of the plant, non-chimeric transgenic plants 
are produced by conventional methods of vegetative 
propagation either in vivo that means from buds or 
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stem cuttings or in vitro following established 
procedures known in the art. 

Thus, the present invention also relates to trans- 
genic plants, parts of plants, plant tissue, plant 
seeds, plant embryos, plant seedlings, plant propa- 
gation material, plant harvest material, plant 
leaves and plant pollen, plant roots containing the 
above identified plants cell of the present inven- 
tion. These plants or plant parts are character- 
ised by, as a minimum, the presence of the het- 
erologous transferred DNA construct of the present 
invention in the genome or, in cases where the 
transferred nucleic acid molecule is autologous to 
the transferred host cell are characterised by ad- 
ditional copies of the nucleic acid molecule of the 
present invention and/or a different location 
within the genome. Thus, the present invention 
also relates to plants, plant tissues, plant seeds, 
plant seedlings, plant embryos, propagation mate- 
rial, harvest material, leaves, pollen, roots, cal- 
luses, tassels etc. non-biologically transformed 
which possess stably or transiently integrated in 
the genome of the cells, for instance in the cell 
nucleus, plastids or mitochondria a heterologous 
and/or autologous nucleic acid sequence containing 
a regulatory element of the present invention rec- 
ognised by the polymerases of the cells of the said 
plant and, in a preferred embodiment, being opera- 
bly linked to a coding sequence of a gene of inter- 
est or containing an antisense construct of the 
above elements. The teaching of the present inven- 
tion is therefore applicable to any plant, plant 
genus or plant species wherein the regulatory ele- 
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ments mentioned above are recognised by the po- 
lymerases of the cell. Thus, the present invention 
provides plants of many species, genuses, families, 
orders and classes that are ably to recognise these 
regulatory elements of the present invention or de- 
rivatives or parts thereof. 

Any plant is considered, in particular plants of 
economic interest for example plants grown for hu- 
man or animal nutrition, plants grown for the con- 
tent of useful secondary metabolites, plants grown 
for their content of fibres, trees and plants of 
ornamental interest. Examples which do not imply 
any limitation as to the scope of the present in- 
vention are corn, wheat, barley, rice, sorghum, 
sugarcane, sugarbeet, soybean, Brassica, sunflower, 
carrot, tobacco, lettuce, cucumber, tomato, potato, 
cotton, Arabidopsis, Lolium, Festuca, Dactylis, or 
poplar. 

The present invention also relates to a process, in 
particular a microbiological process and/or techni- 
cal process, for producing a plant or reproduction 
material of said plant, including an heterologous 
or autologous DNA construct of the present inven- 
tion stably or transiently integrated therein, and 
capable of being expressed in said plants or repro- 
duction material, which process comprises trans- 
forming cells or tissue of said plants with a DNA 
construct containing a nucleic acid molecule of the 
present invention, i.e. a regulatory element which 
is capable of causing the stable integration of 
said nucleic acid molecules in said cell or tissue 
and enabling the expression of an operably linked 
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further nucleic acid molecule in said plant cell or 
tissue, regenerating plants or reproduction mate- 
rial of said plant or both from the plant cell or 
tissue transformed with said DNA construct and, op- 
tionally, biologically replicating said last men- 
tioned plants or reproduction material or both. 

Finally, the present invention relates to a method 
for isolating or cloning pollen and/or root spe- 
cific gene and/or pollen and/or root specific regu- 
latory elements, such as promoters whereby a nu- 
cleic acid sequence of the present invention is 
used to screen nucleic acid sequences derived from 
any source, such as genomic or cDNA libraries de- 
rived from plants, in particular monocotyledonous 
plants. The nucleic acid sequences of the present 
invention thereby provide a means of isolating re- 
lated regulatory sequences of other plant species 
which confer pollen or root specificity to genes of 
interest operably linked to them. 

Further preferred embodiments of the present inven- 
tion are mentioned in the subclaims. 

The invention may be more fully understood from the 
following detailed sequence descriptions which are 
part of the present teaching. The SEQ ID Nos. 1 to 
40 are incorporated in the present invention. The 
positions indicated below refer to the sequence 
numbering of SEQ ID No. 12 in 5' to 3' direction. 

SEQ ID No. 1 represents the complete cDNA- sequence 
of the ZmMADS2 (Zea mays MADS-box) gene. 



wo 01/12799 



-26- 



PCT/EPOO/08002 



SEQ ID No. 2 represents the amino acid sequence of 
the ZmMADS2 protein. 

SEQ ID No. 3 to 9 are partial DNA sequences of the 
ZmMADS2 gene of Zea mays and represent promoters 
and promoter fragments of various lengths. 

SEQ ID No. 3: The 1502 bp (base pair) sequence 
spans the region from and including position 1 to- 
wards the 3' end, up to and including position 
1502 . 

SEQ ID No. 4: The 502 bp (base pair) sequence spans 
the region from and including position 1 towards 
the 3' end, up to and including position 502. 

SEQ ID No. 5: The 500 bp (base pair) sequence spans 
the region from and including position 503 towards 
the 3' end, up to and including position 1002. 

SEQ ID No. 5: The 1000 bp (base pair) sequence 
spans the region from and including position 503 
towards the 3' end, up to and including position 
1502 . 

SEQ ID No. 7: The 740 bp (base pair) sequence spans 
the region from and including position 763 towards 
the 3' end, up to and including position 1502. 

SEQ ID No. 8: The 500 bp (base pair) sequence spans 
the region from and including position 1003 towards 
the 3' end, up to and including position 1502. 
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OSEQ ID No. 9: The 410 bp (base pair) sequence 
spans the region from and including position 1093 
towards the 3' end, up to and including position 
1502 . 

SEQ ID No. 10 is a partial DNA sequence of the 
ZmMADS2 gene of Zea mays. The 310 bp sequence 
spans the region from and excluding the translation 
initiation codon ATG at position 1813 to 1815 to- 
wards the 5' end up to the initiation site of tran- 
scription at position 1503 ( 5 ' -AAACG-3 ' ) . This se- 
quence is the 5' untranslated region (5' UTR) . 

SEQ ID No. 11 is also a partial DNA sequence of the 
ZmMADS2 gene and represents a 3' regulation element 
of 251 bp from and excluding the translation stop 
codon TGA at position 4379 to 4381 to position 
5031 . 

SEQ ID No. 12 is the genomic DNA sequence of the 
ZmMADS2 gene of Zea mays comprising 5031 nucleo- 
tides and encompassing the coding region of the 
gene, an 5' untranslated region of 310 bp [5' UTR: 
position 1503-1812] between and excluding the 
translation initiation codon ATG (position 1813 - 
1815) and including the nucleotide at position 1503 
(also depicted in SEQ ID No. 10) and the regulatory 
elements identified in SEQ ID Nos 3 to 9 and 11. 

SEQ ID No. 13 to 4 0 represent primers used for 
cloning and detecting nucleic acid sequence of the 
present invention and/or transcripts expressed 
thereby. 
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The invention is further illustrated by way of ex- 
ample and the following drawings. 

Figure 1 shows a northern blot analysis for various 
tissues of Zea mays. A signal was obtained exclu- 
sively in mature pollen. 

Figure 2 shows RT-PCR analyses of various maize 
tissues. A signal could be obtained in mature pol- 
len and root tips (lower band; 350 bp) . 

Figure 3 shows PGR analyses of various reproductive 
tissues, in particular of cDNA libraries of egg 
cells, in vitro zygotes 18 hours after fertilisa- 
tion, mature pollen and leaves (from seedlings) . 
signal was obtained only in mature pollen. 

Figure 4 shows RT-PCR analyses of ZmMADS2 expres- 
sion during microgametogenesis of maize. A signal 
was obtained in mature pollen after anthesis. 

Figure 5 shows ZmMADS2 transcript in growing pollen 
tubes of maize. 

Figure 6 shows transient transformation assays of 
N. tabacum. 



A 



Example 1: Cloning of the ZmMADS2 cDNA sequence 
Plant material and pollen isolation 

Tissues were isolated from Zea mays L. inbred li 
A188 (Green and Philips, 1975) cultivated in 
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greenhouse. Embryos from kernels (12 dap and ma- 
ture) were isolated under sterile conditions. 
Seedlings were germinated under sterile conditions 
in the dark and were dissected into cotyledons, 
roots tips and scutella. For isolation of pollen 
before anthesis, tassels were divided into upper 
(mature stage) and lower parts (immature stage) . 
Pollen was isolated as described by Mordhorst et 
al. (1993) and different developmental stages were 
separated via a discontinuous Percoll gradient 
(20%, 30% and 40% Percoll in 0.4 M mannitol) . Cen- 
trifugation was performed for 10 min at 20° C and 
226 X g in a swing out rotor with slow acceleration 
and deceleration. Developmental stages of pollen 
were monitored microscopically and by DAPI staining 
(see below) . 

RNA isolation and construction of cDNA libraries 

Total RNA was isolated from various tissues with 
TRIzol (GibcoBRL) . Seasand was added for the mac- 
eration of pollen. Total RNA was isolated from ma- 
ture pollen for the construction of a cDNA library 
using the protocol described by Stirn et al . 
(19 95) The library was constructed from 5 ^ig 

poly (A)- RNA as outlined by Dresselhaus et al . 
(1996b) using the Uni-ZAP XR lambda vector 
(Stratagene) . Total RNA from leaf material of 10 
day old seedlings was isolated as described by 
Logemann et al . (1987) and a cDNA library was gen- 
erated from 2 ^ig poly (A)* RNA (seedlings library). 

Screening of cDNA library with maize MADS box 
probes 
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The highly conserved MADS box of different maize 
MADS box genes was amplified from the maize genome 
by PCR and served as probes for the plaque screen- 
ing of a cDNA library of mature maize pollen. 

Genomic DNA from leaf material was isolated as out- 
lined by Dellaporta et al . (1983) and served as 
template for the synthesis of different MADS box 
probes. Gene specific primers with the nucleotide 
sequence specified in SEQ ID No. 13 to 24 were used 
to specifically amplify the MADS box region of 
maize MADS box genes: 

ZMMl (5' -ATGGGGAGGGGAAGGATTGA-3' , SEQ ID No. 13; 
5' -CTGTTGTTGGCGTACTCGTAG-3' , SEQ ID No. 14), 
ZEM2/3/ZAG 4 (5 ' -AGGGGCAAGATCGACATCAAG-3 ' , SEQ ID 
No. 15; 

5' -GG/TCGT/AACTCGTAGAGGCGG-3' , SEQ ID No. 16), 
Z AG3 / 5 ( 5 ' - ATGGGGAGGGGACGA/ CGTTGA- 3 ' , SEQ ID No . 
17; 

5' -GCTGCCGAACTCGTAGAGCT-3' ; SEQ ID No. 18), 
ZAPl (5' -GTTGTTGGCGTACTCGTAGAG-3' , SEQ ID No. 19; 
5' -GGGCGCAAGGTACAGCTGAA-3' , SEQ ID No. 20), 
ZAGl ( 5 ' - GTTGTTGGCGTACTCGTAGAG - 3 ' , SEQ ID No . 21; 
5' -AAGGGCAAGACTGAGATCAAG-3, SEQ ID No. 22) and ZAG2 
( 5 ' - CACTTGAACTCTTTTACGCTTAT - 3 ' , SEQ ID No . 23; 
5' -GACAATCTTGACACATGTATGAA-3' , SEQ ID No. 24); 
amplification of the MADS box and flanking genomic 
regions: PCR amplification was performed with 200 
ng genomic DNA in a standard reaction mixture: 250 
nM primer, 2 mM MgCl,, 4 00 fiM dNTPs and 1.25 U Taq 
DNA polymerase (GibcoBRL) in PCR buffer (5 0mM KCl, 
20 mM Tris-HCl, pH 8.4). Hot start PCRs were per- 
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formed with the following profile: 5 min 95° C, 3 
min 75° C (addition of Taq-DNA polymerase) followed 
by 30 cycles with 1 min 96° C, 1 min 62° C (ZEM, 
ZMM, ZAG3) or 60° C (ZAGl, ZAPl) and 3 min 72° C. 
A final extension was performed for 5 min at 72° C. 
PGR products were separated on low melting agarose 
gels (NuSieve GTG, BlOzym) and isolated gel frag- 
ments containing the DNAs were digested with Gelase 
(BlOzym) . Probes were labelled with ["P] -dCTP 
(600 0 Ci/mmol) , Amersham) using the Prime -it II 
random primer labelling Kit (Stratagene) and puri- 
fied with NucTrap columns (Stratagene) . Approxi- 
mately 22.000 phages from the pollen library were 
plated per 15 cm plate and transferred by Hybond-N 
membranes (Amersham) as double plague lifts accord- 
ing to Sambrook et al . (1989). Prehybridisation 
was performed with 50 ng/ml salmon sperm DNA in hy- 
bridisation buffer {5XSSPE, 5x Denharts, 0.5% SDS) 
for 5 h at 55° C. Filters were hybridised with a 
cocktail of the different MADS box probes in a fi- 
nal concentration of 650.000 cpm for each probe/ml 
hybridisation buffer. After hybridisation over- 
night at 55° C, filters were washed three times for 
15 min with 5x SSPE/0,1% SDS and exposed to X-Omat 
AR films (Amersham) using intensifier screens at - 
70° C. Putative positive lambda phages were iso- 
lated and cDNAs excised according to the manufac- 
turer (ZAP-cDNA Synthesis Kit, Stratagene) . 

Thus, approximately 250.000 phages were screened 
with the MADS box probes at medium stringent condi- 
tions to permit hybridisation to less homologous 
sequences. Thirteen putative positive signals were 
analysed further and two cDNAs with high homology 
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to MADS box proteins were isolated, designated 
ZmMADSl and ZmMADS2 . 

5' RACE 

ZmMADS2 cDNA isolated from the pollen cDNA library 
was incomplete at its 5' end and a PCR approach was 
chosen to isolate the full length cDNA from the 
pollen library. For amplification of the missing 
5' end, a gene specific RACE-primer (rapid amplifi- 
cation of CDNA ends, 5 ' -CGATTCAAATCGTGAATCTCAT-3 ' ; 
SEQ ID No. 25) combined with a pBluescript vector 
primer (5' -CCCCCGGGCTGCAGGAATTC-3 ' ; SEQ ID No. 26- 
vector primer) was used in a standard PCR reaction 
with the following profile: 3 min 96° C followed by 
33 cycles: 20 sec 96° C, 30 sec 62°, 2 min 72° C 
and a final extension for 5 min at 72° C. PCR 
products were cloned and sequenced. Full length 
cDNA was amplified from 3 ^1 pollen cDNA library 
with primers specific for the ZmMADS2 3 '-end and 
5 ' - end ( 5 ' - TTTTAGCAACATCTGCACCATT - 3 ' ; SEQ ID No . 
27; 5' -TCTCGGCTAGCTTCCTCCT-3' ; SEQ ID No. 28) using 
a mixture of 2 U Pfu DNA polymerase and 1 U TaqPlus 
Long DNA polymerase (Stratagene) for a first ampli- 
fication round of 25 cycles in a standard PCR reac- 
tion with Pfu reaction buffer (10 mM KCl, 10 mM 
(NHJ-SO,, 2 mM MgSO,, 0.1% TritonXlOO, 20 mM Tris- 
HCl, pH 8.75). 1/10 of purified PCR products 
served as template in a subsequent second amplifi- 
cation round with Pfu DNA polymerase (25 cycles) . 
PCR profiles for both amplification rounds were as 
follows: 3 min 96° C, 5 min 75° C followed by 25 
cycles with 20 sec 96° C, 30 sec 58° C, 90 sec 72° 
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C and a final extension for 5 min at 72° C. PGR 
products were cloned and sequenced. 

DNA Sequencing and sequence analysis 

Sequencing of cDNAs was performed with the ABI 
PRISM Dye Terminator Kit with TaqFS DNA polymerase 
(PE Applied Biosystems) according to the manufac- 
turers protocol, except that 800 ng of template DNA 
and 5 pmole vector primers were used. Sequence 
analyses were performed with DNASIS 1.1 software 
program package (HITACHI) . 

The cDNA of ZmiyiADS2 is 1268 bp in length with an 
open reading frame of 240 amino acids. The 3'-UTR 
is 251 bp (3' UTR: position 3976 - 4226) and the 
5'-UTR is 310 bp (5' UTR: position 1503 - 1812) in 
length, calculated from the putative start point of 
transcription at position 1503 as analyzed by 
primer extension (data not shown) to and excluding 
the ATG at position 1813-1815. 

The sequence of the full-length cDNA is given in 
SEQ ID No. 1. The amino acid sequence of ZmMADS2 
is given in SEQ ID No. 2. ZmMADS2 contains a MADS- 
box at the N- terminal end consisting of 57 
(including the start methionine) amino acids. The 
MADS-box is followed by a linker region of 32 amino 
acid and a K-box comprising 67 amino acids. A pu- 
tative bipartite nuclear localisation signal is lo- 
cated in the MADS box of ZmMADS2 . A bipartite sig- 
nal motive (RR- (X) .^-KRR) is comprised of two basic 
amino acids, a spacer of twelve variable amino ac- 
ids and a basic cluster in which three out of five 
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amino acids are basic. Putative N-glycosylation 
sites (N-X-T/S) and several putative phosphoryla- 
tion sites for protein kinase C (ST-X-RK) and Ca- 
sein Kinase II (ST-X^-DE) have been found. Homology 
searches revealed that ZmMADS2 belongs to the AGL 
17 family of MADS box proteins (Zachgo et al . , 
1997) . 

Example 2: Cloning of the genomic 2mMADS2 clone 

The cDNA isolated in Example 1 was used to clone 
the genomic clone as follows: 

Genomic DNA from leaf material of Zea mays line 
A188 was used to construct libraries with the 
"Universal Genome Walker Kit" {Clontech, Palo 
Alto) . These libraries served as templates for the 
synthesis of four genomic fragments. The amplifi- 
cation was carried out with nested adapter primers 
delivered by the manufacturer and nested gene spe- 
cific primers (Tnorf 1: 5'- 
CGGCCTATAGCTAGCTCTCTTCTTGACCCT-3' , SEQ ID No. 29; 
Front 1 : 5 ' - AGGGTCAAGAAGAGAGCTAGCTATAGG - 3 ' , SEQ I D 
No . 3 0; Tnorf 2 : 5 ' - GCTAAGGAGCGAGAGGTTGTGGTTGTGG- 3 ' , 
SEQ ID No. 31; Front2 : 5'- 
CCACAACCACAACCTCTCGCTCCTTA- 3 ' , SEQ ID No . 3 2 ac- 
cording to the manufacturer's instructions. 

PGR amplification of a fragment containing intron 1 
and the first 247 bp of intron 2 was performed fol- 
lowing the "Genome Walker" protocol. For amplifi- 
cation of the promoter fragments, 5% DMSO was added 
to the reaction mixture. Synthesis of the se- 
quences of introns 3 to 6 was performed in a stan- 
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dard PGR reaction mixture with 200 ng genomic DNA 
template and the following gene specific primers: 
Ex3_oben: 5 ' - TCGGCAGTTGACGGGAGAT - 3 ' , SEQ ID No. 33, 
In3 ' _low : 5 ' -TTAGCAACTCATTATAGCAC- 3 ' , SEQ ID No . 

34, Pe_for: 5' -AGAAACCAGAGATGTTCCAG-3' , SEQ ID No. 

35, Pe - rev : 5 ' - CAACATCTGCACCATTTTGAA- 3 ' , SEQ ID No . 

36, Intron2_l: 5 ' -GTCACATCAACAAGCGTCCTACC-3 ' , SEQ 
ID No. 37, Intron3: 5 ' -TAAGAGCATCAACAGTAGTAACG-3 ' , 
SEQ ID No. 38 . 

All fragments were cloned with the Zero blunt TOPO 
PGR Cloning Kit (Invitrogen, Leek) and sequenced at 
SEQLAB (Gottingen) , except for fragments containing 
Intron 2. Amplification of Intron 2 with different 
primer pairs (Intron2_l and Pe-rev, Intron2_l and 
In3'_low, Intron 2_1 and Intron3, Front2 and In- 
tron3) using different DNA polymerases (PfuTurbo, 
Stratagene; PlatinumTaq, GibcoBRL; AdvantageTth, 
Clontech) showed that this Intron is about 11 kb in 
length . 

The nucleotide sequence of the genomic clone as de- 
picted in SEQ ID No. 12 comprises 5031 nucleotides. 
Intron No. 2 has not been fully sequenced. Depend- 
ing upon the start of transcription, the promoter 
comprises nucleotides 1 to 1502, since a transcrip- 
tion start point has been determined at position 
1503 5' -AAACGG-3' . From position 1503 to the ATG 
initiation translation codon at position 1813 to 
1815 is the 5' untranslated region. Exons and con- 
sequently also introns are indicated in SEQ ID No. 
12. In intron II between position 3203 and 3204 
there remains some sequence ambiguity. Further nu- 
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cleotides are present in intron II at this position 
(circa 11 kb) . 

The translation termination codon TGA is located at 
position 4778 to 4780 with a transcription termina- 
tion signal between and including position 4781 and 
5031. A putative polyadenylation site TATAA is lo- 
cated at position 4888 to 4893. 

The promoter from position 1 to position 1502 de- 
picted in SEQ ID No. 3 may be divided into func- 
tional promoter fragments with sequences as indi- 
cated in SEQ ID No. 4 to 9. SEQ ID No. 4 comprises 
at position 17 to 27 an induction element respon- 
sive to dryness, abscisic acid and/or coldness and 
at position 151 to 155 a root-specific element. 
The promoter fragment of SEQ ID No. 5 comprises at 
positions 532 to position 536, position 542 to 546, 
positions 634 to 638 and position 705 to 709 a 
root-specific element, at position 654 to 658 an 
element responsible for induction by sugar deple- 
tion and at position 684 to 689 an element respon- 
sive to dryness, abscisic acid and/or coldness. 

The promoter set out in SEQ ID No. 8 comprises an 
element responsible for induction by dryness, ab- 
scisic acid and/or coldness located at position 
1019 to 1025 and 1369 to 1375, a root-specific ele- 
ment at position 1083 to 1087 and a sucrose respon- 
sive element at position 1192 to 1200, as well as 
an element responsible for induction by sugar de- 
pletion at position 1259 to 1263. The element con- 
ferring root-specificity at position 1083 to 1087 
is particularly important due to its root- 
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specificity. Deletion of this element in the pro- 
moter fragments and promoters of the present inven- 
tion provides a promoter being exclusively ex- 
pressed in pollen. 

In the 5' UTR depicted in SEQ ID No. 10, an element 
responsible for induction by sugar depletion is lo- 
cated at position 1701 to 1708. 

Further elements conferring root -specificity are 
located at the following positions: 2281 to 2286, 
2518 to 2522, 2575 to 2579, 2802 to 2807, 3014 to 
3018, 4295 to 4299, 4506 to 4510, 4700 to 4704, 
4888 to 4892. 

Further elements responsible for induction by dry- 
ness, abscisic acid and/or coldness are located at 
the following positions: 2014 to 2019, 2055 to 
2060, 2072 to 2077, 2392 to 2397, 2584 to 2589, 
2679 to 2683, 2842 to 2846, 2863 to 2869, 3112 to 
3117, 3231 to 3236, 4067 to 4072, 4228 to 4233, 
4242 to 4246, 4344 to 4350, 4479 to 4484, 4486 to 
4492 . 

Example 3: Northern blot and PGR analyses 

Ten /xg of total RNA extracted from various tissues 
were separated on denaturating agarose gels and 
transferred to Hybond N* membranes (Amersham) by 
capillary blotting with lOx SSC overnight. The RNA 
was fixed to the membrane by UV crosslinking with 
300 mJoule in a Stratalinker 1800 (Stratagene) . 
Gene specific probes were amplified from plasmids 
containing ZmMADS2 cDNAs with primers specific for 
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the 3 '-ends of ZmMADS2 (5' -AGAAACCAGAGATGTTCCAG-3 ' ; 
SEQ ID No. 39, 5' -CAACATCTGCACCATTTTGAA-3' ; SEQ ID 
NO. 40) according to the profile described for the 
5 'RACE. Probes were labelled as described above. 
Prehybridisation was performed for 5 h at 65" C 
with 100 Mg/ml salmon DNA in hybridisation buffer 
(7% SDS, 1 mM EDTA, 0.5 M NaH.PO, , pH 7.2). Probes 
were added in a final concentration of lO' cpm/ml 
hybridisation buffer and hybridised overnight at 
65° C. Filters were washed with decreasing concen- 
trations of SSC with a final wash of 0.2xSSC/0.1% 
SDS for 15 min at 65° C and were exposed to X-Omat 
AR films (Amersham) at -70° C using intensifier 
screens. Fig. 1 shows the spatial and temporal ex- 
pression of ZmMADS2 (dag: days after germination, 
dap: days after pollination). It is shown that the 
ZmMADS2 coding sequence is expressed in pollen, in 
particular in mature pollen after dehiscence indi- 
cating an essential role in pollen development and 
pollen function, in particular pollen tube growth. 

RT-PCR analyses of tissues for ZmMADS2 expression 
were performed with 1 ixg of total RNA (Figure 2). 
RNA was reverse transcribed with SOU SuperScriptll 
reverse transcriptase (GibcoBRL) for 75 min at 50° 
C with the ZmMADS2 specific primer (see above) in 
reaction buffer (75 mM KCl, 3 mM MgCl,, 50 mM Tris- 
HCl, pH 8.3) with 0.4 mM dNTPs, 10 mM DTT and 10 U 
RNasin (Promega) . The complete RT-PCR reaction was 
used as template in the subsequent standard PGR re- 
action with ZmMADS2 specific primers (see above) . 
PGR products were separated on an agarose gel blot- 
ted and hybridised to radiolabelled ZmMADS2 spe- 
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cific probe. Figure 2 shows ZmMADS2 expression m 
mature pollen and root tips (lower band) . 

For quantitative RT-PCR analysis of ZmMADS2 expres- 
sion in pollen, RNA samples were treated with 
DNasel according to the manufacturer's protocol 
(GibcoBRL) . 250 ng and 500 ng, respectively, of 
each RNA probe were reverse transcribed and a stan- 
dard PGR was performed with 27 cycles (ZmMADS2) as 
described above (Figure 4) . cDNA libraries of 
maize egg cells, in vitro zygotes (Dresselhaus et 
al., 1994 and 1996a), pollen and seedlings were 
analysed for the presence of ZmMADS2 cDNAs with 
gene-specific primers using 3 ^lg of cDNA libraries 
as templates in a standard PGR as described above. 
To confirm specificity of PGR products, gels were 
blotted and hybridised to radiolabelled gene- 
specific probes (Fig. 3). Fig. 3 shows that among 
various reproductive cells ZmMADS2 is only ex- 
pressed in pollen. 

Thus, it was shown that ZmMADS2 is most abundant in 
mature pollen after dehiscence. At that stage pol- 
len are dehydrated and metabolically inactive. The 
onset of ZmMADS2 expression therefore takes place 
after the pollen reaches maturity, but before dehy- 
dration inhibits transcriptional activity (Figure 
4) . A faint signal was detected in root tips (Fig. 
2) which is confirmed by RT-PCR analyses as shown 
in Fig. 2. The lower band represents amplified 
ZmMADS2 cDNA which is detectable in mature pollen 
and root tips. The upper band represents genomic 
DNA. 
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In order to localise ZmMADS2 transcripts whole 
mount RNA in situ hybridisation experiments were 
performed. 
Example 4 

Whole mount in situ hybridisation and DAPI staining 

Pollen was germinated in liquid medium as described 
by Torres et al . (1995). Germination rates were 
determined microscopically and only samples with 
high germination rates were used for the experi- 
ments. All media were prepared with DEPC-treated 
deionized water. Germinated pollen were fixed for 
2 to 3 h in 4% paraf ormaldehyd in germination me- 
dium at room temperature. Fixation and all follow- 
ing steps were performed in Eppendorf tubes and 
pollen was precipitated by centrif ugation for 1 min 
at 1500 rpm at room temperature for the exchange of 
medium. Pollen was dehydrated by passage through a 
series of 3:7, 1:1, 7:3 ME : PP solutions at room 
temperature for 30 min each (ME: 90% methanol and 
10% 0.5 M EGTA; PP : 4% paraformaldehyde in PBS 
(phosphate buffered saline, 130 mM NaCl, 10 mM 
NaHjPO,, pH 7.4) . Pollen was incubated for 10 min 
in 100% ethanol and for 10 min in 100% xylol before 
rehydration through an ethanol series was performed 
(85%, 70%, 55%, and 35% ethanol in water) . Prote- 
inase K (Boehringer) treatment was carried out for 
35 min at 37° C with 1 /xg/ml proteinase in 100 mM 
Tris-HCl (pH 7.5) with 50 mM EDTA. Reactions were 
stopped with 2 mg/ml glycine in PBS (2 min) and 
pollen was washed in PBS (2 min) . Pollen were re- 
fixed for 20 min at room temperature with 4% 
paraformaldehyde in PBS and were washed twice for 5 
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...in with PBS. Prehybridisation was performed with 
100 iigMl tRNA and 100 ^g/ml mRNA (Boehringer) in 1 
ml hybridisation buffer (6x SSC, 0.1% SDS , 50% 
Forraamid) at 42° C for 3 h with gentle shaking. 
After prehybridisation, 800 m1 of the buffer was 
removed and the remaining 200 ^1 were distributed 
to two 500 fxl Eppendorf tubes. Probes were added 
to a final concentration of 500 ng/ml. DIG- 
labelled RNA probes were synthesised from the 3' 
end of ZmMADS2 cloned into pGEM-T vector (Promega) 
with Sp6 and T7 RNA Polymerase (Boehringer) accord- 
ing to the manufacturer's protocol. Hybridisation 
was carried out at 42°C overnight without shaking. 
The samples were washed for 10 min at 42° C with 2x 
SSC, 0.1% SDS with gentle shaking before RNase 
treatment with 40 /zg/ml RNaseA (Boehringer) was 
performed for 30 min at 31" C in 150 ^1 R^ase 
buffer (10 mM Tris-HCl (pH 8), 0.5 M NaCl, 1 mM 
EDTA) . Samples were washed three times for 10 min 
in IX SSC, 0.1% SDS and 0 . 5x SSC, 0.1% SDS at 42° C 
and equilibriated for 5 min in Bl buffer 
(Boehringer) . Blocking was performed in Bl buffer 
with 1% blocking reagent (Boehringer) and 0.3% Tri- 
tonXlOO for Ih at room temperature. Detection was 
performed with NBT/BCIP according to the manufac- 
turer's protocol (Boehringer). The reactions were 
monitored microscopically and stopped by adding 
buffers at neutral pH (Tris-HCl or Bl-buf fer) . Re- 
actions could be restarted by retransf erring the 
probes into reaction mixture. 

DAPI staining was performed with pollen fixed in 
ethanol:acetic acid (3:1) for 1 h at room tempera- 
ture and dehydrated through an ethanol series (75%, 
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55%, 35% ethanol in water) . Pollen was transferred 
to Tris-HCl buffer (pH 7.5) with 200 ng/ml DAPI and 
staining was monitored under UV light (359/441 nm) . 
The in situ hybridisation experiments with ZmMADS2 
antisense probes showed transcripts in the emerging 
pollen tube 5 min after germination (Figure 5a) . 
Fig. 5b as a control shows that a sense probe does 
not detect transcripts in the emerging pollen tube. 
During pollen tube growth, transcripts are translo- 
cated into pollen tubes displaying a gradient with 
increasing concentration toward the tip (Figure 
5c) . Fig. 5d as a control shows that a sense probe 
does not detect transcripts in pollen tubes. The 
translocation of ZmMADS2 transcripts into the pol- 
len tube precedes migration of the vegetative nu- 
cleus in the sperm cells into pollen tubes by more 
than two hours. Possibly, ZmMADS2 is required only 
later in the germination process, ZmMADS2 tran- 
script levels in microspores are very low (Figure 
4) . Maximal expression in mature pollen after de- 
hiscence and localisation of transcripts in pollen 
tubes clearly classifies ZmMADS2 as a late pollen 
gene. The above findings clearly imply a function 
for ZmMADS2 during pollen tube growth, in particu- 
lar a function as a transcription factor, i.e. in 
regulating the transcription of one or more target 
genes . 

Example 4: Transient transformation assays 

Pollen transformation was carried out according a 
modified protocol of Hamilton et al. (1992). Plas- 
mids containing the ZmMADS2 promoter fragments 
fused to the luciferase reporter gene and the NOS 
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terminator were used to coat gold particles (0,3 ^m 
- 3 nm) which were then bombarded into pollen 
grains of Nicotiana tabacum (SRI) using a helium 
driven biolistic device (PDS-lOOO/He Biolistic Par- 
ticle Delivery System, Bio-Rad) . The plasmids con- 
tained fragments of SEQ ID No. 3, Pos . 1-153 8 
(PDNS5), 455-1538 (pDNS6) , 1006-1538 (pDNS7) , 1175- 
1538 (pDNSB) or no promoter (negative control) . 

Pollen were spread out with a brush on sterile 
plastic petri dishes (30 mm x 15 mm) containing 
pollen medium (0,01% H,B03. 10 mM CaCl„ 0,05 mM 
KH.PO,, 0,1% yeast extract and 10% sucrose) with 
0,6% agarose. Bombardments were done 5-20 min after 
spreading of the pollen. For bombardments done with 
leaf tissue, tips of young leaves were cut and 
placed on petri dishes containing sterile water 
with 0,6% agarose. The plates were kept at room 
temperature for 12-16 h prior to measurement of lu- 
ciferase activity. 

Measuring of luciferase activity was carried out 
with the Luciferase Assay Kit (Promega) following 
the protocol of the manufacturer (Technical Bulle- 
tin #TB101) except for the preparation of the tis- 
sues. Germinated pollen, along with the agarose me- 
dium, were transferred to ice cold tubes followed 
by addition of 0,2 ml of cold lysis buffer and 
ground with a manual homogenizer. Leaf tips were 
transferred into ice cold tubes and homogenized in 
a Retschmill (MM 2000). Thereto 0,2 ml of lysis 
buffer was added. All samples were centrifuged at 
10,000 X g for 5 min. at 4°C. The supernatants were 
assayed for luciferase activity in a Luminometer 
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(Lumat LB 9501, Berthold) . Corrections were made 
for the volume of liquid remaining in the agarose 
pellets. Total numbers of assayed samples per con- 
struct were 30 (pollen) or 5 (leaves) . 
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Claims 

1. A nucleic acid molecule for use in cloning and 
expressing a pollen specific nucleic acid sequence 
in a plant which is selected from the group con- 
sisting of 

(a) the nucleic acid sequence set out in SEQ ID No. 
3 to 12, or a complementary strand thereof 

(b) a nucleic acid sequence which hydridizes to the 
nucleic acid sequence set out in (a) or a comple- 
mentary strand thereof and 

(c) alleles or derivatives of the nucleic acid se- 
quence defined in (a) or (b) or a complementary 
strand thereof. 

2. The nucleic acid molecule of claim 1, which is 
derived from maize. 

3. The nucleic acid molecule of claim 1 or 2 , which 
is a DNA, cDNA or RNA molecule. 

4. A vector comprising the nucleic acid molecule of 
any one of claims 1 or 2. 

5. The vector of claim 3, which is a bacterial or 
viral vector. 
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G, The vector of any one of claims 4 or 5, wherein 
the nucleic acid molecule of any one of claims 1 to 
3 is operably linked to a gene of interest. 

7. The vector of claim 6, wherein the gene of inte- 
rest is chosen from a gene which confers to a plant 
a character of agronomic or industrial benefit, a 
gene for resistence to infection by a virus, a gene 
conferring resistence to a herbicide or to an in- 
secticide, a gene whose expression confers male 
sterility and a gene being toxic to pollen tissue. 

8. The vector of any one of claims 4 to 7, which 
comprises further regulatory elements directing or 
enhancing the expresion of the gene of interest. 

9. The vector of any one of claims 4 to 8, wherein 
the further regulatory elements are 5^,3' or 5' and 
3 ' elements . 

10. The vector of claim 9, wherein the 3' regulato- 
ry element is the poly A addition sequence of the 
NOS gene of Agrobacterium tumefaciens. 

11. The vector according to any one of claims 4 to 
10, which furthermore contains T-DNA, in particular 
the left, the right or both T-DNA borders. 

12. The vector according to claim 11, wherein the 
nucleic acid molecule, optionally in conjunction 
with the gene of interest, is located within the T- 
DNA or adjacent to it. 
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13. A host cell containing the vector of any one of 
claims 4 to 12 or a cell deriving therefrom. 

14. The host cell of claim 13 which is a plant, 
yeast or bacterial cell, in particular a cell from 
a monocotyledonous or dicotyledonous plant. 

15. A cell culture, preferably a plant cell culture 
comprising a cell according to any one of claims 13 
or 14 . 

16. A method of genetically modifying a cell by 
transforming a cell with a nucleic acid molecule of 
any one of claims 1 to 3 or a vector according to 
any one of claims 4 to 12, wherein the gene of in- 
terst contained in the vector is expressible in the 
cell. 

17. The method of claim 18, wherein the cell is a 
plant, bacterial or yeast cell. 

18. The method of claims 16 or 17, wherein the 
transformed cell is regenerated into a differentia- 
ted plant . 

19. The method of any one of claims 16 to 18, whe- 
rein the cell is transformed by transfer of nucleic 
acid sequences from a bacterium to the cell. 

20. The method of any one of claims 16 to 19, whe- 
rein the cell is transformed by direct uptake of 
nucleic acid sequences, by microinjection of nuc- 
leic acid sequences or particle bombardment. 
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21. A method for isolating a pollen specific gene 
and/or pollen specific regulatory element from a 
plant, whereby a nucleic acid sequence of any one 
of claims 1 to 3 is used to screen nucleic acid se- 
quences derived from the plant. 

22. A plant comprising a host cell according to any 
one of claims 13 or 14 or produced according to a 
method according to any one of claims 16 to 20 or 
progeny thereof. 

23. Propagation and harvest material, in particular 
seeds and plant tissue, comprising a host cell ac- 
cording to any one of claims 13 or 14 or derived 
from a plant according to claim 22. 
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Transformation of Nicotiana tabacum - transient assays 

Expression of luciferase in mature pollen and young leafs directed by deletion- 
constructs of the ZmMADS2 - promoter 
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<110> Sudwescdeutsche Saatzucht Dr. H. R, Spath 

<120> Regulatory Sequences for Pollen Specific or Pollen 
Abundant Gene Expression in Plants 

<130> 23597 WO 

<140> 
<141> 

<160> 40 

<170> Patentin Ver. 2.1 

<210> 1 
<211> 1268 
<212> DNA 
<213> Zea mays 

<400> 1 

tctcggctag cttcctcctc tctccctcct cttcgtcttc ctctttcctt ctcccttggg 60 

aaacctgctg cctttgagct ttcttcttcg agtgctccca ccagatctcc tcctccttac 120 

cttctttggc acgttcggcg gcgcgcgcgg agatagatag atcccgccat cgtcgtcgcc 180 

gtccttgctt ccgatcggag ggccacaacc acaacctctc gccccttagc gtgcgcgcgc 240 

gagccagggt caagaagaga gctagctata ggccggagat cgatggggag gggaaagatc 3 00 

gtgatccgca ggatcgataa ctccacgagc cggcaggtga ccttctccaa gcgccggaac 360 

gggatcttca agaaggccaa ggagctcgcc atcctctgcg atgcggaggt cggcctcgtc 420 

atcttctcca gcaccggccg cctctacgag tactctagca ccagcatgaa atcagttata 480 

gatcggtacg gcaaggccaa ggaagagcag caagtcgtcg caaatcccaa ctcggagctt 540 

aagttttggc aaagggaggc agcaagcttg agacaacaac tgcacaactt gcaagaaaat 600 

tatcggcagt tgacgggaga tgatctttct gggctgaatg tcaaagaact gcagtccctg 660 

gagaatcaat tggaaacaag cctgcgtggt gtccgcgcaa agaaggacca tctcttgata 720 

gatgagattc acgatttgaa tcgaaaggca agtttatttc accaagaaaa tacagacttg 780 

tacaataaga tcaacctgat tcgccaagaa aatgatgagt tacataaaaa gatctatgag 840 

actgaaggac caagtggagt taatcgggag tcaccgactc cattcaactt tgcagtagta 900 

gaaaccagag atgttccagt gcaacttgaa ctcagcacac tgccacaaca aaataacatt 960 

gagccatcta ctgctcctaa gctaggattg caattaattc catgaagaag agtaaaactg 1020 

ccgtcttatg atgctgaagg aaactattta ttgtgaagag atgatactca gagaaagaca 1080 

tatttgtggc agggagattt gagatatgaa cttataaatg taatgcaaat aattttcaga 1140 

ctggaatggg gtcgtggaat tcagaggatg attgctttct agttgcattt gatgtttgat 1200 

gagacttttc tcatgtgaaa cgtttattaa aacttcaaaa tggtgcagat gttgctaaaa 1260 



<210> 2 

<211> 240 

<212> PRT 

<213> Zea mays 

<400> 2 

Met Gly Arg Gly Lys He Val He Arg Arg He Asp Asn Ser Thr Ser 
15 10 15 

Arg Gin Val Thr Phe Ser Lys Arg Arg Asn Gly He Phe Lys Lys Ala 
20 25 30 

Lys Glu Leu Ala He Leu Cys Asp Ala Glu Val Gly Leu Val lie Phe 
35 40 45 

Ser Ser Thr Gly Arg Leu Tyr Glu Tyr Ser Ser Thr Ser Met Lys Ser 
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Val He Asp Arg Tvr Glv Lys Ala Lys Glu Glu Gin Gin Val Val Ala 
65 ' 70 75 80 

Asn P-o Asn Ser Glu Leu Lys Phe Trp Gin Arg Glu Ala Ala Ser Leu 
35 90 95 

Arg Gin Gin Leu His Asn Leu Gin Glu Asn Tyr Arg Gin Leu Thr Gly 
100 105 110 

Asp Asp Leu Ser Gly Leu Asn Val Lys Glu Leu Gin Ser Leu Glu Asn 
115 120 125 

Gin Leu Glu Thr Ser Leu Arg Gly Val Arg Ala Lys Lys Asp His Leu 
130 135 140 

Leu He Asp Glu He His Asp Leu Asn Arg Lys Ala Ser Leu Phe His 
145 150 155 160 

Gin Glu Asn Thr Aso Leu Tyr Asn Lys He Asn Leu He Arg Gin Glu 
X65 170 175 

Asn AsD Glu Leu His Lys Lys He Tyr Glu Thr Glu Gly Pro Ser Gly 
180 185 190 

Val Asn Arg Glu Ser Pro Thr Pro Phe Asn Phe Ala Val Val Glu Thr 
195 200 205 

Arg Asp Val Pro Val Gin Leu Glu Leu Ser Thr Leu Pro Gin Gin Asn 
210 215 220 

Asn He Glu Pro Ser Thr Ala Pro Lys Leu Gly Leu Gin Leu He Pro 
225 230 235 240 



<210> 3 
<211> 1502 
<212> DNA 
<213> Zea mays 

<400> 3 



ctgaagccgt cgtgttcctc acgtgcattg gctacaaatg tatgtaacat agtaaatctg 60 
attgaaaagt taaatctcaa taattattat tacgacaaac taaaaaccta ttccatttct 120 
tcgcatatga aaaatacagg tattccatgt aatataggaa caaatgtata atgcggctac 180 
tttgatgaac aatttacaaa tgcattaaaa ggtacgtaac ataataaatc cgtcatgtaa 240 
tgttgtaact atataaaata gcaacataaa tttaattgca actaaaaacc gcctttttct 300 
atgcgaaata atttacacta gcttcatata ggaacagatg gaaaaccaaa tcattaatgt 360 
ctgcattatc atgcatcatt ctggtatgaa gaaagagcgc gtatagccag catgacagat 420 
ccattgctta caacaaacgt tcaaaggtgg taaagtaaag acctcgaccg gaaccattgg 480 
gcccacttgc aaaatttaat: ggcgattcta catctaaaat gtcttgtttc aaatataaga 540 
caatatggtt gtatctc::gc ttttacccca atgggttcta tcaatgatgg ttgaggtcct 600 
ccattattta ggacgcggtc aagtgcatca acgaatatga tcacttctac actcgacggt 660 
tctcctaaat ttatacagca ttacagttat atatctacag tcatatattt acgatacttt 720 
aaatggaatc agaaatagaa cgcaatgctt aaataaaacc aatgaatctt aaagaaagat 780 
ttggatccag atatagccca gcaacctatg acaacagtat acatataatg ccaatgccaa 840 
catgtctcaa gtgctcctaa aaactatgta cattctccgc cccgtttact: caaatacgta 900 
aaatagcact gctctacaat: aagtacatct attatcacct ataaaacttt ttccagcaac 960 
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atttacacaa cttgrgcacc atcaacttta acaacaccca caaattagct caaatacgta 1020 

acaaagcgcg cccacgggcg cgcccaagcc actagncccc acacagcgaa aacaatacta 1080 

ctatactaga cacaaaacta ggcgtagttc attcatttac cggtactacC tctttaagtc 1140 

ataatcaggt gtaccagaag ctactacgcc ctgacggaga caagcacgca caatagaaaa 1200 

gagcgccccc gtgctcagag tgctcgcctc tagtcccgtg gctactacat accagctgcg 1260 

acgatttcta ttttcctcct cttccgggct accatcagca cggccattta ggggcaatga 1320 

gcagcggcca ataaacaaag ccccggggta gccaatcggg tgtcgccgtg cgcgcgcgcg 1380 

cgcgtccggc ctcctgccct cctcccccct tcccttgcca aataaagcaa gtagcagata 1440 

ggggaaagtc tgctcgcaag ttgcatccgc cccgcgccaa gaaaagccat cgttcttccc ISOO 



■c210> 4 
<211> 502 
<212> DNA 
<213> Zea mays 



ctgaagccgt cgtgCtcctc acgcgcactg gctacaaatg catgtaacat agtaaatctg 60 
attgaaaagt taaatctcaa taactattat tacgacaaac caaaaaccta ttccatttct 120 
ttgcatatga aaaatacagg tattccatgt aatataggaa caaatgtata atgcggctac 18 0 
tttgatgaac aatttataaa cgcattaaaa ggratgtaac ataataaatc cgtcacgtaa 240 
tgttgtaact atataaaaca gcaacaCaaa ttcaattgca actaaaaacc gcctttttct 300 
atgcgaaata attcacacta gcttcatata ggaatagacg gaaaaccaaa tcattaatgt 360 
ccgcattatc atgcattatt ctggtatgaa gaaagagcgc gtatagccag catgacagat 420 
ccattgttta caacaaatgt tcaaaggtgg taaagtaaag acctcgaccg gaaccattgg 480 
gcccacttgc aaaatttaat gg 



<210> 5 
<211> 500 
<212> DNA 
<213> Zea mays 



<400> 5 

tgattctaca tctaaaatgt tttgtttcaa 
ttaccccaat gggttctacc aatgatggtt 
gtgcatcaac gaatatgatc acttctacac 
acagttatat atttatagtt atatatttat 
caatgcttaa ataaaaccaa tgaatcttaa 
aacttatgac aatagtatac atataatgcc 
actatgtaca ttttctgctt cgtttactca 
gtacatctat tatcacccat aaaacttttt 
taattttaac aacacttata 



atataagaca atatggttgt atctttgctt 60 
gaggtcctcc attatttagg atgtggtcaa 120 
tcgacggttc tcctaaattt atacagtatt 180 
gatactttaa acggaaccag aaatagaatg 240 
agaaagattt ggatccagat atagtccagc 300 
aatgccaaca tgtttcaagt gctcttaaaa 360 
aatacgtaaa atagcactgc tctataataa 420 
ccagcaacat ttatataatt tgtgcatcat 480 

500 



<210> 6 

<211> 1000 

<212> DNA 

<213> Zea mays 



cgattctaca tctaaaatgt tttgtttcaa atataagaca atatggttgt atctttgctt 60 
ttaccccaat gggttctatc aatgatggtt gaggtcctcc attatttagg atgtggtcaa 120 
gtgcatcaac gaatatgatc acttctacac tcgacggttc tcctaaattt atacagtatt 180 
acagttatat atttatagtt atatatttat gatactttaa atggaatcag aaatagaatg 240 
caatgcttaa ataaaaccaa tgaatcttaa agaaagattt ggatccagat atagtccagc 300 
aacttatgac aatagtatac atataatgcc aatgccaaca tgtttcaagt gctcttaaaa 360 
actatgtaca ttttctgctt cgtttactca aatacgtaaa atagcactgc tctataataa 4.0 
gtacatctat tatcacctat aaaacttttt ccagcaacat ttatataatt tgtgcatcat 480 
taattttaac aacacttata aattagctca aatacgtaac aaagcgcgcc catgggcgcg 540 
ctcaaactac tagttcttat acagtgaaaa caatactact atattagaca taaaattagg 600 
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tgtagcttac ttacctaccg gcaccacttc cttaagtcat aatcaggcgc ar-agaagct 660 
actacgccct aaccigagaca agcacgcaca acagaaaaga gcgccctcgt gctcagagtg 720 
ctcgccttta gtttcgtggc tactatatac tagctgcgac gatttttatt ttctttctct 780 
tctgggctat cattagcacg gccatttagg ggtaatgagc agcggccaat aaacaaagcc 840 
ctggggtagc caatcgggtg tcgccgtgcg cgcgcgcgcg cgtctggcct cctgccctcc 900 
tcctcccttt ccttgctaaa taaagcaagt agcagatagg ggaaagtctg ctcgcaagtt 960 
gcatccgccc tgcgccaaga aaagccatcg ttcttcccac 100 



<210> 7 
<211> 740 
<212> DNA 
<213> Zea mays 



<400> 7 

tgaatcttaa agaaagattt ggatccagat 
atataatgcc aatgccaaca tgtttcaagt 
cgtttactca aatacgtaaa atagcactgc 
aaaacttttt ccagcaacat ttatataatt 
aattagctca aatacgtaac aaagcgcgcc 
acagtgaaaa caatactacu atattagaca 
gtactacttc tttaagtcat aattaggtgt 
agcacgcaca atagaaaaga gcgccctcgt 
tactatatac tagctgcgac gatttttatt 
gccatttagg ggtaatgagc agcggccaat 
tcgccgtgcg cgcgcgcgcg cgtctggcct 
taaagcaagt agcagatagg ggaaagtctg 
aaagccatcg ttcttcccac 



atagtccagc aacttatgac aatagtatac 60 
gctcttaaaa actatgtaca ttttctgctt 120 
tctataataa gtacatctat tatcacctat 180 
tgtgcatcat taattttaac aacacttata 240 
catgggcgcg ctcaagctac tagttcttat 300 
taaaattagg tgtagtttat ttatttactg 360 
actagaagct actacgccct gacggagaca 420 
gctcagagtg ctcgccttta gtttcgtggc 480 
ttctttctct tctgggctat cattagcacg 540 
aaacaaagcc ctggggtagc caatcgggtg 600 
cctgccctcc tcctcccttt ccttgctaaa 660 
ctcgcaagtt gcatccgccc tgcgccaaga 720 

740 



<210> 8 
<211> 500 
<212> DNA 
<213> Zea mays 



<400> 8 

aattagctca aatacgtaac aaagcgcgcc 
acagtgaaaa caatactact atattagaca 
gtactacttc tttaagtcat aattaggtgt 
agcacgcaca atagaaaaga gcgccctcgt 
tactatatac tagctgcgac gatttttatt 
gccatttagg ggtaatgagc agcggccaat 
tcgccgtgcg cgcgcgcgcg cgtctggcct 
taaagcaagt agcagatagg ggaaagtctg 
aaagccatcg ttcttcccac 



catgggcgcg ctcaagctac tagttcttat 6 0 
taaaattagg tgtagtttat ttatttactg 120 
actagaagct actacgccct gacggagaca 180 
gctcagagtg ctcgccttta gtttcgtggc 240 
ttctttctct tctgggctat cattagcacg 300 
aaacaaagcc ctggggtagc caatcgggtg 3 60 
cctgccctcc tcctcccttt ccttgctaaa 420 
ctcgcaagtt gcatccgccc tgcgccaaga 480 

500 



<210> 9 
<211> 410 
<212> DNA 
<213> Zea mays 



<400> 9 

taaaattagg tgtagtttat ttatttactg 
actagaagct actacgccct gacggagaca 
gctcagagtg ctcgccttta gtttcgtggc 
ttctttctct tctgggctat cattagcacg 
aaacaaagcc ctggggtagc caatcgggtg 
cctgccctcc tcctcccttt ccttgctaaa 
ctcgcaagtt gcatccgccc tgcgccaaga 



gtactacttc tttaagtcat aattaggtgt 60 
agcacgcaca atagaaaaga gcgccctcgt 120 
tactatatac tagctgcgac gatttttatt 180 
gccatttagg ggtaatgagc agcggccaat 24 0 
tcgccgtgcg cgcgcgcgcg cgtctggcct 300 
taaagcaagt agcagatagg ggaaagtctg 360 
aaagccatcg ttcttcccac 410 
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<210> 10 

<211> 310 

<212> DNA 

<213> 2ea mays 

<400> 10 

aaacgcacac atagaagcat cattcccctc tcggczagcC tcctcctctc tccctcctct 60 

tcgtctcccc ctttccttct cccttgggaa acctgccgcc tttgagcttt cttcttcgag 120 

tgctcccacc agatctcctc ctccttacct tctttggcac gttcggcggc gcgcgcggag 180 

atagatagat cccgccaccg tcgtcgccgt cctcgcttcc gatcggaggg ccacaaccac 240 

aacctctcgc tcctcagcgt gcgcgcgcga gccagggtca agaagagagc tagctatagg 3 00 

310 

ccggagatcg 



<210> 11 
<211> 251 
<212> DNA 
<213> Zea mays 

<400> 11 

agaagagtaa aactgccgtc ttatgatgct gaaggaaact atttattgtg aagagatgat 60 
actcagagaa agacatattt gtggcaggga gattcgagat atgaacttat aaatgtaatg 120 
caaataattt tcagactgga atggggtcgt ggaattcaga ggatgattgc tttctagttg 180 
catttgatgt ttgatgagac ttttctcacg tgaaacgttt attaaaactt caaaatggtg 240 
cagatgttgc t ^^-^ 



<210> 12 
<211> 5031 
<212> DNA 
<213> Zea mays 

<220> 

<22l> misc_feature 
<222> (1813) . . (1994) 
<223> EXON I 

<220> 

<221> misc_feature 
<222> (2871) . . (2954) 
<223> EXON II 

<220> 

<22l> misc_feature 
<222> (3202) . . (3258) 
<223> EXON III 

<220> 

<221> misc_feature 

<222> (4066) . . (4166) 

<223> EXON IV 

<220> 

<221> misc_feature 
<222> (4248) . . (428S) 
<223> EXON V 

<220> 

<221> misc_feature 
<222> (4367) . . (4449) 
<223> EXON VI 
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<220> 

<221> misc__f eature 
<222> (4523) . . (4676) 
<223> EXON VII 

<220> 

<400> 12 

ctgaagctgt cgcgttcctc acgtgcattg 
attgaaaagt taaatcccaa taattattat 
ttgcatatga aaaatacagg tattccatgc 
tttgatgaac aatttataaa tgcattaaaa 
tgttgtaact atataaaaca gcaacataaa 
atgcgaaata atttacacta gcttcatata 
ctgcattatc atgcatratt ctggtatgaa 
ccattgttta caacaaatgt tcaaaggtgg 
gcccacttgc aaaatttaat ggtgattcta 
caatatggtt gtatcttcgc ttttacccca 
ccattattta ggatgtggcc aagtgcatca 
tctcctaaat ttatacagca ttacagttat 
aaatggaatc agaaatagaa tgcaatgctt 
ttggatccag atatagticca gcaacttatg 
catgttccaa gtgctcttaa aaactatgta 
aaatagcact gctctacaat aagtacatct 
atttatataa tttgtgcatc attaatttta 
acaaagcgcg cccatgggcg cgctcaagct 
ctatattaga cataaaacta ggtgtagttt 
ataattaggt gtactagaag ctactacgcc 
gagcgccctc gcgctcagag tgctcgcctt 
acgattttta ttttctccct cttctgggct 
gcagcggcca acaaacaaag ccctggggta 
cgcgtctggc ctcctgccct cctcctccct 
ggggaaagtc tgctcgcaag ttgcatccgc 
acaaacgcac acatagaagc atcattcccc 
cttcgtcttc ctctttcctt ctcccttggg 
agtgctccca ccagatcccc tcctccttac 
agatagatag atcccgccat cgtcgtcgcc 
acaacctctc gctccccagc gtgcgcgcgc 
ggccggagat cgatggggag gggaaagatc 
cggcaggtga ccttctccaa gcgccggaac 
atcctctgcg atgcggaggt cggcctcgtc 
tactctagca ccaggtcgtc gcccttcgat 
tacttcaata atgccaactg ttttcttgct 
tagagttgtt cttggcctct tgctttaatt 
tgaaactaca cactacctgt ctcacataca 
caccatgtat acacaagccc taaggtagac 
aatattgact gtacatacag cgcacagagc 
ggcaagcaag acctgtgtat gcggtctaac 
tttgtgtttc tgcccctcct cttccttgtc 
caaaaaattc cctatgtatc atcggaattc 
ttgagtggca tatagtaggg aacatatgca 
tttcatgtga tcaactcgac acagcatgtc 
atgtaagata tatgtgcgta tatatatgca 
tactgccatc agagagtttt tccctagaga 
ctgtaattaa gattttaact ttcttagata 
catagtaata gtttttccct gggataaacc 
gttatagatc ggtacggcaa ggccaaggaa 
gagcttaagt tctgctctag ataatttttt 
ctaaaagatc gccatacctt ttttgctctc 
gttttcaagg acgggaagaa actacacaac 
ttttatcnta tccagccgct tgatctcctc 
tgccctacag cgagtcgcaa cgcaaaggga 



gctacaaatg tatgtaacat agtaaatctg 60 
tacgacaaac taaaaaccta ttccatttct 120 
aatataggaa caaatgtata atgcggctac 180 
ggtatgtaac ataataaatc cgtcatgtaa 240 
tttaattgca actaaaaacc gcctttttct 300 
ggaatagatg gaaaaccaaa tcattaatgt 360 
gaaagagcgc gtatagccag catgacagat 420 
taaagtaaag acctcgaccg gaaccattgg 480 
catctaaaat gttttgtttc aaatataaga 540 
atgggttcta tcaatgatgg ttgaggtcct 600 
acgaatatga tcacttctac actcgacggt 660 
atatttatag ttatatattt atgatacttt 720 
aaataaaacc aatgaatctt aaagaaagac 780 
acaatagtat acatataatg ccaatgccaa 840 
cattttctgc ttcgtttact caaatacgta 900 
attatcacct ataaaacttt ttccagcaac 960 
acaacactta taaattagct caaatacgta 1020 
actagttctt atacagtgaa aacaatacta 1080 
atttatttac tggtactact tctttaagtc 1140 
ctgacggaga caagcacgca caatagaaaa 1200 
tagtttcgtg gctactatat actagctgcg 1260 
atcattagca cggccattta ggggtaatga 1320 
gccaatcggg tgtcgccgtg cgcgcgcgcg 1380 
ttccttgcta aataaagcaa gtagcagata 1440 
cctgcgccaa gaaaagccat cgttcttccc 1500 
tctcggctag cttcctcctc tctccctcct 1560 
aaacctgctg cctttgagct ttcttcttcg 1620 
cttctttggc acgttcggcg gcgcgcgcgg 1680 
gtccttgctt ccgatcggag ggccacaacc 1740 
gagccagggt caagaagaga gctagctata 1800 
gtgatccgca ggatcgataa ctccacgagc 1860 
gggatcttca agaaggccaa ggagctcgcc 1920 
atcttctcca gcaccggccg cctctacgag 1980 
tattatccgt ccctgcttta attaatttaa 2040 
cctgttgaag cgatatacta tatataataa 2100 
agtttgggtt atcgattgca ttttatgatt 2160 
tggactggcg agagtgatta tggacctata 2220 
aaaagatttt aatttacctg tcgattgtcg 2280 
tagcaacaac actcctggtt tctttgacgg 2340 
aataaaatac aatctttggt acggttagta 2400 
gtttggtcta tttgaactat tattcctctc 2460 
atactcatcc cttgcattgt gtgccatata 2520 
aaactaaatt taattagctt tgtcatattg 2580 
attgcagtag gaaacgcata ggcttttgtg 2640 
tggggttcta tccacagggg ctggtagtgt 2700 
gagagagaga gagagcctag atcctaactc 2760 
gctataaact taatatttta caacgaatag 2820 
gacacttgtg ctgtgtgcag catgaaatca 2880 
gagcagcaag tcgtcgcaaa tcccaactcg 2940 
ctctctattc tgaagtgttt gcttgtttaa 3000 
gttttgatat atatatcctc taagtttatg 3060 
catggtcaca tcaacaagcg tcccacctct 3120 
tagttaccag cccgggccgr cgaccacgcg 3180 
ggcagcaagc ttgagacaac aactgcacaa 3240 
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cttgcaagaa aattatcgca tggctcgaaa 
ttagaaactt gagtgttcag agacagaagc 
aggtacaact attacacact catagataac 
gtgaaaggct agacattaga catcaaaata 
attattctac ggccaacaac aaaactaaaa 
cccctgcttc attttaaagg tgcatttgca 
tctaacagct accctatcac atcccctatc 
acagtgtaat ctagagtaca gtgcaaatcc 
ctaacattgt actttgacca tgaatttctc 
aatgttgttt caaaagcctt gtgagatatg 
tgcatataat tcgattaacc gttagacaaa 
acaaaagaaa cagagggtat atcaaaacat 
aaaacacaca catatatcgg caattggcaa 
ataatacaca acattaatcg ctgatgccta 
tgatctttct gggctgaatg tcaaagaacc 
cctgcgtggt gtccgcgcaa agaaggtttg 
tcaataaatt actaaagatc gttactactg 
agatgagatt cacgatttga atcgaaaggt 
tttatttgat ttagtagcac tgactaacca 
caagaaaata cagacttgta caataagatc 
cataaaaagg tagctgatga attgaacaaa 
acatgatatt aattgttctc agatctatga 
gtcaccgact ccattcaact ttgcagtagt 
actcagcaca ctgccacaac aaaataacat 
taacatttat tctctctata tatttcaatt 
tgagttgcta aacacatgca gattgcaatt 
ttatgatgct gaaggaaact atttattgtg 
gtggcaggga gatttgagat atgaacttat 
atggggtcgt ggaattcaga ggatgattgc 
ttttctcatg tgaaacgttt attaaaactt 



accctgcaaa agcttcaagc aagctattca 3300 
acaatgcaca agcatgcaac acccaaacgc 3360 
agaaacaatt agcaagagtc aaattatcat 3420 
cttcatacga gataagttga acatgtgaaa 3480 
caaaacaggg atatcacagt taaactactt 3540 
tttgcatttg aacaagtttt taaggctgtc 3600 
tcacctcata ttacaaactt cattcagcaa 3660 
tctattttac acaatcttgc cgcggatagc 3720 
ttatccaata aatccaaacc cagaaaattc 3780 
aataatgata caattttcat acactaagac 3840 
ttctgtaaaa ttaaacggcc ttttttccat 3900 
atcctgatga tgctccaaac atttatagag 3960 
acataagcta gactgtgtgt gtataattgt 4020 
ttaatttgtt acagggcagt tgacgggaga 4080 
gcagtccctg gagaatcaat tggaaacaag 414 0 
attcttaatc aaatttctat aatcaattgt 4200 
ttgatgctct taacaggacc acctcttgat 4260 
ctgtaatatc gtgaatttat agttatgtgc 4320 
ctttgttggt cacaggcaag tttatttcac 4380 
aacctgattc gccaagaaaa tgatgagtta 4440 
aataccacta actgcctaac cacacatatg 4500 
gactgaagga ccaagtggag ttaatcggga 4560 
agaaaccaga gatgttccag zgcaacttga 4620 
tgagccatct actgctccta agctagggta 4680 
ggttaactag tgttgactac agtgctataa 4740 
aattccatga agaagagtaa aactgccgtc 4800 
aagagatgat actcagagaa agacatattt 4860 
aaatgtaatg caaataattt tcagactgga 4920 
tttctagttg catttgatgt ttgatgagac 4980 
caaaatggtg cagatgttgc t 5031 



<210> 13 

<211> 20 

<212> DNA 

<213> Zea mays 

<400> 13 

acggggaggg gaaggattga 



<210> 14 

<211> 21 

<212> DNA 

<213> Zea mays 

<4Q0> 14 

ctgttgttgg cgtactcgta g 



<210> 15 
<211> 21 
<212> DNA 
<213> Zea mays 

<400> 15 

aggggcaaga tcgacatcaa g 



<210> 16 
<211> 19 
<212> DNA 
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<213> Zea mays 
<400> 16 

19 

gkcgv/acccg tagaggcgg 



<210> 17 
<211> 20 
<212> DNA 
<213> 2ea mays 

<400> 17 

atggggaggg gacgmgttga 



<210> 18 
<211> 20 
<212> DNA 
<213> Zea mays 

<400> 18 

20 

gctgccgaac tcgtagagct 



<210> 19 
<211> 21 
<212> DNA 
<213> Zea mays 

<400> 19 

21 

gttgttggcg tactcgtaga g 



<210> 20 
<211> 20 
<212> DNA 
<213> Zea mays 

<400> 20 

20 

gggcgcaagg tacagctgaa 



<210> 21 
<211> 21 
<212> DNA 
<213> Zea mays 

<400> 21 

gttgttggcg tactcgtaga g 



<210> 22 

<211> 21 

<212> DNA 

<213> Zea mays 

<400> 22 

aagggcaaga ctgagatcaa g 



<210> 23 
<211> 23 
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<212> DNA 
<213> Zea mays 

<400> 23 23 
cacttgaact ctttcacgcc tat 



<210> 24 
<211> 23 
<212> DNA 
<213> Zea mays 

<400> 24 2^ 
gacaatcttg acacatgtat gaa 



<210> 25 
<211> 22 
<212> DNA 
<213> Zea mays 

<400> 25 22 



cgattcaaat cgtgaatctc at 



<210> 26 
<211> 20 
<212> DNA 
<213> Zea mays 

<400> 26 20 
cccccgggct gcaggaattc 



<210> 27 
<211> 22 
<212> DNA 
<213> Zea mays 

<400> 27 22 
ttttagcaac atctgcacca tt 



<210> 28 
<211> 19 
<212> DNA 
<213> Zea mays 

<400> 28 



tctcggctag cttcctcct 



<210> 29 
<211> 30 
<212> DNA 
<213> Zea mays 

<400> 29 



cggcctatag ctagctctct tcttgaccct 



<210> 30 
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<211> 27 
<212> DNA 
<213> Zea mays 

<400> 30 

agggtcaaga agagagctag ccatagg 27 



<210> 31 

<211> 28 

<212> DNA 

<213> Zea mays 

<400> 31 

gctaaggagc gagaggrtgt ggttgtgg 



<210> 32 
<211> 26 
<212> DNA 
<213> Zea mays 

<400> 32 

ccacaaccac aacctcticgc tcctta 



<210> 33 

<211> 19 

<212> DNA 

<213> Zea mays 

<400> 33 

tcggcagttg acgggagac 



<210> 34 

<211> 20 

<212> DNA 

<213> Zea mays 

<400> 34 

ttagcaactc attatagcac 



<210> 35 

<211> 20 

<212> DNA 

<213> Zea mays 

<400> 35 

agaaaccaga gatgttccag 



<210> 36 

<211> 21 

<212> DNA 

<213> Zea mays 



<400> 36 

caacatctgc accanLtrga a 
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<210> 37 
<211> 23 
<212> DNA 
<213> Zea mays 

<400> 37 

gtcacaccaa caagcgccct acc ^-^ 



<210> 38 
<211> 23 
<212> DNA 
<213> Zea mays 

<400> 33 

taagagcatc aacagtagta acg ^-^ 



<210> 39 
<211> 20 
<212> DNA 
<213> Zea mays 

<400> 39 

20 

agaaaccaga gatgttccag 



<210> 40 
<211> 21 
<212> DNA 
<213> Zea mays 



<400> 40 

caacatctgc accattttga a 
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